Kanak Saxena Analysis of Data Mining Techniques on Real Estate
نویسندگان
چکیده
223 Abstract— Data mining techniques are broadly classified into two classes (i) Statistical Techniques and (ii) Knowledge Discovery. The continuing rapid growth of on-line data and the widespread use of databases necessitate the development of techniques for extracting useful knowledge and for facilitating database access. This paper analyzes the results of multilayer perceptron with pace regression and suggests a very efficient pattern which can be proved beneficial for knowledge discovery. The analysis is done using real estate data set which contains 5821 tuples and 43 attributes and determines that in India’s scenario the demographic details of a person plays a very prominent role in identifying the investment behavior of a customer. In multilayer perceptron model, input layer is followed by two hidden layers. The first hidden layer contains 21 nodes as per various attribute weight age followed by second hidden layer which assigns re-processed weights to each of the 21 nodes. If we are discarding the demographic details then the model which is available consists of 13 Sigmoid nodes and there is a major change in error rate and correlation. We have used WEKA for analysis and found that in general multilayer perceptron(selected) is more efficient then pace regression(complete) in terms of statistical methods, but in Indian perception pace regression(complete) is more efficient than multilayer(selected).
منابع مشابه
Analysis of Missing Value Estimation Algoithms for Data Farming
In this paper we compare various statistical method of estimation of missing data values. Missing data estimation is a part of data farming. Data Farming is a process to grow the data & provides a more comprehensive understanding of the possible outcomes, and offers the opportunity to discover outliers, surprises. Many times data mining task use existing data collected for various other purpose...
متن کاملA comparative analysis of Bayesian methods for Real Estate domain
Bayesian classifier has gained wide popularity as a probability-based classification method despite its assumption that attributes are conditionally mutually independent given the class label. This paper makes a study into various algorithms to improve the classification accuracy of Bayesian methods with respect to real estate datasets. We have applied Bayesian methods on two variations of data...
متن کاملIdentifying Buying Preferences of Customers in Real Estate Industry Using Data Mining Techniques
*M.Tech. Scholar, Amity University, Noida **Asst. Professor, Amity University, Noida ABSTRACT With an enormous amount of data stored in databases and data warehouses, it is increasingly important to develop powerful tools for analysis of such data and mining interesting knowledge from it. Data mining is a process of inferring knowledge from such huge data. The main problem related to the retrie...
متن کاملAn Efficient Classification Algorithm for Real Estate domain
Classification rule mining aims to discover a small set of rules in the database that forms an accurate classifier. In classification rule mining there is one and only one predetermined target. In this paper, we proposed an algorithm, which performs preprocessing and cleaning prior to traditional classification. Experimental results show that the classifier built this way is, in general, more a...
متن کاملA Way to Understand Various Patterns of Data Mining Techniques for Selected Domains
This has much in common with traditional work in statistics and machine learning. However, there are important new issues which arise because of the sheer size of the data. One of the important problem in data mining is the Classificationrule learning which involves finding rules that partition given data into predefined classes. In the data mining domain where millions of records and a large n...
متن کامل